Structured Prediction with Perceptron: Theory and Algorithms

نویسنده

  • Kai Zhao
چکیده

Structured prediction problem is a special case of machine learning problem where both the inputs and outputs are structures such as sequences, trees, and graphs, rather than plain single labels or values. Many important natural language processing (NLP) problems are structured prediction problems, including Part-Of-Speech tagging, parsing, and machine translation. This survey investigates how perceptron, the simplest supervised machine learning algorithm, can be adapted to handle structured inputs and outputs. In particular, for the estimation problem that searches for the right output structure, introducing structures leads to combinatorial explosion and the inference becomes extremely time-consuming. Various dynamic programming techniques and approximations are developed to speed up the search. For the learning problem, especially supervised learning with perceptron, searching structures with approximations makes the standard perceptron learning algorithm unconvergable. To address this problem, new learning methods are invented for structured learning. In addition, complicated structured prediction problems usually involve unobserved structures as hidden variables. A latent variable perceptron model is also discussed to handle those problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Structured Perceptron with Inexact Search

Most existing theory of structured prediction assumes exact inference, which is often intractable in many practical problems. This leads to the routine use of approximate inference such as beam search but there is not much theory behind it. Based on the structured perceptron, we propose a general framework of “violation-fixing” perceptrons for inexact search with a theoretical guarantee for con...

متن کامل

Online Learning for Inexact Hypergraph Search

Online learning algorithms like the perceptron are widely used for structured prediction tasks. For sequential search problems, like left-to-right tagging and parsing, beam search has been successfully combined with perceptron variants that accommodate search errors (Collins and Roark, 2004; Huang et al., 2012). However, perceptron training with inexact search is less studied for bottom-up pars...

متن کامل

Performance Comparison of Training Algorithms for Semi-Supervised Discriminative Language Modeling

Discriminative language modeling (DLM) has been shown to improve the accuracy of automatic speech recognition (ASR) systems, but it requires large amounts of both acoustic and text data for training. One way to overcome this is to use simulated hypotheses instead of real hypotheses for training, which is called semisupervised training. In this study, we compare six different perceptron algorith...

متن کامل

Dual Coordinate Descent Algorithms for Efficient Large Margin Structured Prediction

Due to the nature of complex NLP problems, structured prediction algorithms have been important modeling tools for a wide range of tasks. While there exists evidence showing that linear Structural Support Vector Machine (SSVM) algorithm performs better than structured Perceptron, the SSVM algorithm is still less frequently chosen in the NLP community because of its relatively slow training spee...

متن کامل

Optimal Shift-Reduce Constituent Parsing with Structured Perceptron

We present a constituent shift-reduce parser with a structured perceptron that finds the optimal parse in a practical runtime. The key ideas are new feature templates that facilitate state merging of dynamic programming and A* search. Our system achieves 91.1 F1 on a standard English experiment, a level which cannot be reached by other beam-based systems even with large beam sizes.1

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014